Learning and Fusing Multi-View Code Representations for Function Vulnerability Detection
نویسندگان
چکیده
The explosive growth of vulnerabilities poses a significant threat to the security software systems. While various deep-learning-based vulnerability detection methods have emerged, they primarily rely on semantic features extracted from single code representation structure, which limits their ability detect hidden deep within code. To address this limitation, we propose S2FVD, short for Sequence and Structure Fusion-based Vulnerability Detector, fuses vulnerability-indicative learned multiple views more accurate detection. Specifically, S2FVD employs either well-matched or carefully extended neural network models extract token sequence, attributed control flow graph (ACFG) abstract syntax tree (AST) representations function, respectively. These capture different perspectives code, are then fused enable accurately that well-hidden function. experiments conducted two large datasets demonstrated superior performance against state-of-the-art approaches, with its accuracy F1 scores reaching 98.07% 98.14% respectively in detecting presence vulnerabilities, 97.93% 97.94%, respectively, pinpointing specific types. Furthermore, regard real-world dataset D2A, achieved average gains 6.86% 14.84% terms metrics, over baselines. This ablation study also confirms superiority fusing semantics implied distinct further enhance performance.
منابع مشابه
Multi-View Learning for Web Spam Detection
Spam pages are designed to maliciously appear among the top search results by excessive usage of popular terms. Therefore, spam pages should be removed using an effective and efficient spam detection system. Previous methods for web spam classification used several features from various information sources (page contents, web graph, access logs, etc.) to detect web spam. In this paper, we follo...
متن کاملMulti-View Perceptron: a Deep Model for Learning Face Identity and View Representations
Various factors, such as identity, view, and illumination, are coupled in face images. Disentangling the identity and view representations is a major challenge in face recognition. Existing face recognition systems either use handcrafted features or learn features discriminatively to improve recognition accuracy. This is different from the behavior of primate brain. Recent studies [5, 19] disco...
متن کاملStatistical Learning of Multi-view Face Detection
A new boosting algorithm, called FloatBoost, is proposed to overcome the monotonicity problem of the sequential AdaBoost learning. AdaBoost [1, 2] is a sequential forward search procedure using the greedy selection strategy. The premise oÿered by the sequential procedure can be broken-down when the monotonicity assumption, i.e. that when adding a new feature to the current set, the value of the...
متن کاملLearning Probabilistic Distribution Model for Multi-View Face Detection
Modeling subspaces of a distribution of interest in high dimensional spaces is a challenging problem in pattern analysis. In this paper, we present a novel framework for pose invariant face detection through multi-view face distribution modeling. The approach is aimed to learn a set of low-dimensional subspaces from an originally nonlinear distribution by using the mixtures of probabilistic PCA...
متن کاملtight frame approximation for multi-frames and super-frames
در این پایان نامه یک مولد برای چند قاب یا ابر قاب تولید شده تحت عمل نمایش یکانی تصویر برای گروه های شمارش پذیر گسسته بررسی خواهد شد. مثال هایی از این قاب ها چند قاب های گابور، ابرقاب های گابور و قاب هایی برای زیرفضاهای انتقال پایاست. نشان می دهیم که مولد چند قاب تنک نرمال شده (ابرقاب) یکتا وجود دارد به طوری که مینیمم فاصله را از ان دارد. همچنین مسایل مشابه برای قاب های دوگان مطرح شده و برخی ...
15 صفحه اولذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Electronics
سال: 2023
ISSN: ['2079-9292']
DOI: https://doi.org/10.3390/electronics12112495